Determining Hit Rate in Pattern Search
نویسندگان
چکیده
The problem of spurious apparent patterns arising by chance is a fundamental one for pattern detection. Classical approaches, based on adjustments such as the Bonferroni procedure, are arguably not appropriate in a data mining context. Instead, methods based on the false discovery rate the proportion of flagged patterns which do not represent an underlying reality may be more relevant. We describe such procedures and illustrate their application on a marketing dataset.
منابع مشابه
Heparin induced thrombocytopenia
Abstract Background and Objectives Heparin is still a commonly used anticoagulant in prophylaxis and treatment of thromboembolic events. Heparin-induced thrombocytopenia (HIT) is a life-threating adverse drug reaction of heparin. The diagnosis of HIT is made based on two important criteria, firstly clinical evaluation and secondly laboratory testing. In this comprehensive review, the authors w...
متن کاملJournal of Clinical and Diagnostic Research
Various algorithms are in use in medical processes to improve the speed, sensitivity and accuracy of the computations and analyses involved in those experiments. The aim of this paper is to suggest three improvements, namely Multi Hit, Dropoff percentage and NCM-2 in the BLAST algorithm. BLAST (Basic Local Alignment Search Tool) is a popular tool used for determining the patterns in genomic seq...
متن کاملEvaluation of Ontology-based User Interests Modeling
Deriving users’ interests from their online searching and browsing behaviors is an important research direction with several applications in content search and management. Manually edited Web directories, such as Open Directory Project (ODP) or Yahoo! 2 directory, provide ontology of concepts (categories) along with pages relevant to those categories. Aiming to evaluate and compare the performa...
متن کاملAiming strategy error analysis and verification of a billiard training system
A low cost training system is proposed for regular billiard game tutoring. We describe the elements to construct an interactive computer system which helps train billiard players in enhancing their skills. Most research on computer billiard has focused on creating highly competitive billiard playing programs, based on various search algorithms. Game playing strategies are embedded into these pr...
متن کاملA machine learning approach for result caching in web search engines
A commonly used technique for improving search engine performance is result caching. In result caching, precomputed results (e.g., URLs and snippets of best matching pages) of certain queries are stored in a fast-access storage. The future occurrences of a query whose results are already stored in the cache can be directly served by the result cache, eliminating the need to process the query us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002